339 results found.
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International License
Size:
None Production Status:
Existing-used
Use:
-
Paper title:An Empirical Comparison of Unsupervised Constituency Parsing Methods
-
Paper track:Short/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jun Li | The Keyaki Treebank | /N |
Documentation:
http://www.compling.jp/keyaki/
Written
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
From Owner
License:
Size:
1,830,000 entries Production Status:
Existing-used
Use:
Summarisation
-
Paper title:Improving Truthfulness of Headline Generation
-
Paper track:Long/Summarization
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kazuki Matsumaru | JNC & JAMUL | /N |
Documentation:
None
Written
Treebank,
Language Type:
Multilingual
Languages:
Chinese English French German Italian Japanese Russian Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries
-
Paper track:Short/Machine Learning for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mozhi Zhang | Universal Dependencies | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Chinese English French German Italian Japanese Russian Spanish
Availability:
From NIST
License:
Size:
None Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries
-
Paper track:Short/Machine Learning for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mozhi Zhang | Reuters RCV1/RCV2 Multilingual Corpus | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
From Owner
License:
In preparation
Size:
80 GByteProduction Status:
Newly created-finished
Use:
Dialogue
-
Paper title:Neural Spoken-Response Generation Using Prosodic and Linguistic Context for Conversational Systems
-
Paper track:11.1 Spoken dialog systems/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Akinori Ito | SMOC | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
CC-BY-SA 4.0
Size:
3.3 GByteProduction Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Cross-lingual Speaker Adaptation using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis
-
Paper track:7.11 Cross-lingual and multilingual aspects in spe/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Detai Xin | JVS (Japanese versatile speech) corpus | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Not Available
License:
Size:
60 minutesProduction Status:
Newly created-in progress
Use:
Machine Learning
-
Paper title:Using Transposed Convolution for Articulatory-to-Acoustic Conversion from Real-Time MRI Data
-
Paper track:1.1 Models of speech production/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ryo Tanji | Japanese rtMRI dataset | /N |
Documentation:
NoneLanguage Type:
Multilingual
Languages:
Japanese
Availability:
Not Available
License:
<Not Specified>
Size:
942 Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:Building a Corpus of Manually Revised Texts from Discourse Perspective
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Ryu Iida | Tokyo Institute of Technology | JP | National Institute of Information and Communications Technology | JP |
| Author 2 | Takenobu Tokunaga | Tokyo Institute of Technology | JP | ||
| Main Contact | Ryu Iida | National Institute of Information and Communications Technology | None |
Documentation:
<Not Specified>
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
License:
Size:
3000000 sentences Production Status:
Use:
-
Paper title:Generating Diverse Translations with Sentence Codes
-
Paper track:Short/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphael Shu | ASPEC | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English Japanese Mandarin Chinese
Availability:
Freely Available
License:
http://lotus.kuee.kyoto-u.ac.jp/ASPEC/#agreement.html
Size:
None Production Status:
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Bilingual Subword Segmentation for Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hiroyuki Deguchi | Asian Scientific Paper Excerpt Corpus | /N |
Documentation:
None




